Six approaches to limited domain concatenative speech synthesis
نویسندگان
چکیده
This paper (based on an MS Thesis by Robert Utama in the Electrical and Computer Engineering department at Rutgers University) describes 6 limited-domain Text-to-Speech (TTS) systems that are constrained to the digit string and natural number domains (cardinal numbers only). Each of the 6 unit selection-based concatenative TTS systems were implemented in MATLAB. We evaluate and discuss various factors that influenced the naturalness or overall quality of the synthesized voice. Some of the factors studied were the length and type of the synthesis unit and the extent of co-articulation represented in the recorded speech database. We show that it is possible to create a high quality limited domain TTS system either with maximal or with carefully controlled minimal effects of co-articulation.
منابع مشابه
Introduction to multilingual corpus-based concatenative speech synthesis
This tutorial paper addresses foreign-language support in corpus-based concatenative text-to-speech systems. We give an overview of application domains where strictly monolingual speech synthesis is not sufficient and where multilingual text-to-speech is required or highly desirable. We describe two approaches to multilingual corpus-based speech synthesis: phoneme mapping on the one hand, and t...
متن کاملSpectral smoothing for concatenative speech synthesis
This paper addresses the topic of performing e ective concatenative speech synthesis with a limited database by proposing methods to smooth the transitions between speech segments. The objective is to produce naturalsounding speech via segment concatenation when formants and other spectral features do not align properly. We propose several methods for adjusting the spectra between waveform segm...
متن کاملمراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی
Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...
متن کاملStudy on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملLimitations to concatenative speech synthesis
This paper discusses techniques for determining the linguistic needs for open-domain synthesis by concatenative methods, and reports on the design and evaluation of a tool for collecting and balancing a speech corpus automatically, in order to ensure optimal coverage of the sounds required for synthesis within a given task-domain. Syntheticallygenerated utterances are used to prompt speakers, a...
متن کامل